Utilizing prosody for unconstrained morpheme recognition

نویسندگان

Volker Strom

Henrik Heine

چکیده

Speech recognition systems for languages with a rich in ectional morphology (like German) su er from the limitations of a word{based full{form lexicon. Although the morphological and acoustical knowledge about words is coded implicitly within the lexicon entries (which are usually closely related to the orthography of the language at hand) this knowledge is usually not explicitly available for other tasks (e.g. detecting OOV words). This paper presents an HMM{based `word' recognizer that uses morphemes on the string level for recognizing spontaneous German conversational speech (Verbmobil corpus). The system has no explicit word knowledge but uses a morpheme{bigram to capture the German word and sentence structure to some extent. The morpheme recognizer is tightly coupled with a prosodic classi er in order to compensate for some of the additional ambiguity introduced by using morphemes instead of words. Although the recognizer's morpheme accuracy of 85:3% is comparable to that of our word{based decoder (word accuracy 86%) until now the bene t of introducing the prosodic classi er is not yet clear.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Large Vocabulary Continuous Speech Recognition for Estonian Using Morpheme Classes

This paper describes development of a large vocabulary continuous speaker independent speech recognition system for Estonian. Estonian is an agglutinative language and the number of different word forms is very large, in addition, the word order is relatively unconstrained. To achieve a good language coverage, we use pseudo-morphemes as basic units in a statistical trigram language model. To im...

متن کامل

Large Vocabulary Continuous Speech Recognition for Estonian Using Morphemes and Classes

متن کامل

Evidence Theory-Based Multimodal Emotion Recognition

Automatic recognition of human affective states is still a largely unexplored and challenging topic. Even more issues arise when dealing with variable quality of the inputs or aiming for real-time, unconstrained, and person independent scenarios. In this paper, we explore audio-visual multimodal emotion recognition. We present SAMMI, a framework designed to extract real-time emotion appraisals ...

متن کامل

Robust Iris Recognition in Unconstrained Environments

A biometric system provides automatic identification of an individual based on a unique feature or characteristic possessed by him/her. Iris recognition (IR) is known to be the most reliable and accurate biometric identification system. The iris recognition system (IRS) consists of an automatic segmentation mechanism which is based on the Hough transform (HT). This paper presents a robust IRS i...

متن کامل

Spoken Keyword Rescoring and Document Retrieval for Low-resource Languages

For languages that have adequate data for automatic speech recognition (ASR), many keyword search(KWS) and document retrieval(SDR) systems have been developed with near-optimal performance. However, lacking of sufficient training data to produce high accuracy transcript, identification and retrieval of queries in speech data from low-resources languages remains challenging. To compensate for th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Utilizing prosody for unconstrained morpheme recognition

نویسندگان

چکیده

منابع مشابه

Large Vocabulary Continuous Speech Recognition for Estonian Using Morpheme Classes

Large Vocabulary Continuous Speech Recognition for Estonian Using Morphemes and Classes

Evidence Theory-Based Multimodal Emotion Recognition

Robust Iris Recognition in Unconstrained Environments

Spoken Keyword Rescoring and Document Retrieval for Low-resource Languages

عنوان ژورنال:

اشتراک گذاری